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Mathematical Contributions to the Theory of Evolution. 


XVI. ON FURTHER METHODS OF DETERMINING CORRELATION. 
By Kart Pearson, F.R.S. 


(1) Introductory. The object of the present paper is to give an account of some 
new methods of determining correlation. It is not suggested that they can with 
advantage replace the old processes, even when the distribution is approximately 
normal; to my mind the methods of determining the correlation ratio and the 
correlation coefficient (y and 7) based on moments and product moments stand fore- 
most for the information they give and its weighable accuracy. At the same time 
there are series which are so short, or cases in which it is desirable to come rapidly 
to an approximate result or data which cannot be presented in a form suitable for 
product-moment working, where other methods are not only reasonable, but necessary. 
To such cases the present new methods apply. I have termed them new methods 
and I think this is legitimate. In the case of the first method, I have not seen any 
hint of it before. In the case of what I term grade methods, Dr Spearman has 
suggested that rank in a series should be the character correlated, but he has not 
taken this rank correlation as merely the stepping stone by which to reach the true 
correlation of the variables as dependent magnitudes, and further in the discussion 
he has given of the subject he has, I believe, given erroneous formulae and made 
quite incorrect statements as to the magnitude of probable errors. 

One word must be said as to the use made of the normal distribution. I have 
used it here as on many other occasions as a means of suggesting fitting relations 
and simple formulae for correlation constants. This does not necessarily mean (i) that 
the constants reached may not have a perfectly definite meaning apart from normal 
distributions, or (ii) that the formulae obtained may not hold for all forms of 
distribution apart from normality. As an illustration of the first case I cite my 
mean square coefficient of contingency*. This is a perfectly general measure of the 
deviation from independent probability in the case of an 1x m fold table, but its 

* On the Theory of Contingency. ‘Drapers’ Research Memoirs, Biometric Series 1” (Dulau & Co., 
Soho Square, London, 
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actual form was selected so that it would agree with the coefficient of correlation in 
the case of indefinitely fine grouping and normal distribution. As an illustration 
of my second point I take the formulae given by me for the influence of selection on 
variation and correlation*. These formulae were originally proved for normal dis- 
tributions, but for a number of years past the proofs given in my lectures have 
been perfectly general, depending only on a more comprehensive definition of what 
we are to understand as correlation in the case of a complex of variables. 

These points will be considered in the present treatment of correlation. 


(2) On Difference Methods of finding the Coefficient of Correlation. 


Let « and y be two correlated variables, each measured from their means m, and 

m, respectively. Then if v=a—y, and o,, o,, 7, denote the three standard deviations 
Cy =O, + Oy 2,0 on 
and Taj = (Og +04 =O) 20307) a ee are eee (i). 

This method of finding 7,, has long been in use as an alternative method to the 
product-moment method f. 

It involves finding the mean-square difference of the values of the pairs of 
correlated characters. It is possible, however, to find r,, from about one half 
these differences, if we assume the distribution to be normal. . 

More generally I proceed as follows. Suppose the function ma —ny formed, where 
mand n are at present indeterminate positive constants, and let the positive values 
only of this expression be taken and divided by the total frequency N. Then it will 
be possible to determine 7 from this result. 

If z be the ordinate of the surface, then : 

eee are =a ( 
270710» J1—r 


and we have the above result expressed analytically: 


S (ma — 1 1 ly each fee sh 
(ma — ny) ‘Z [[(x —ny)e 1-7 (5 Sa -) dyda ...(ii1). 


a* ray re r) 
= 2 


Cr Ono, en Oy 


NN 13 270,0,/1 —7 

The limits to y in order that mx—ny may be positive are y= —o to ma/n, and 
the limits of x will then be r= a0 to —oo. 

Put Gia Lice 
then 

mos 
S(ma—ny) 1 1 +o Noy a4 = fa ee f say 
at gr er ene = [. (mo,a —no,y’) x e tie aes ae. dy dee. 


* Phil. Trans. A, Vol. 200, pp. 1—66, 
+ For example, Phil. Trans. A, Vol. 198, p. 242, and often elsewhere. 
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But the integral with regard to 2’ is /27 8, and 


n 1 he 
—4 : 
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0 


S(mex—ny) 1 


Hence : W <= ee = Qa Qa Ba’. 


Or, for the positive summation 


S(ma—ny) _ no; (1-7) J/o2—2rmno,o,+ mo; an 
(O89 Woe —2rmo,(no,—ma,)—7r Map 


This general value does not appear to be likely to be of much service. If we take 


m=n=1, we obtain the result of simply summing the positive differences of paired 
variates. It is: 


S(a-y)_ of (1-1)Vo3-2ro,0,+0; (v) 
Bee (seg oar ter ‘ 
(v) leads to an equation of the 5th order to find 7 and again does not appear to 
be likely to be of any service. The variates must be reduced to a common unit before 


they are handled if we are to make (iv) workable. Such a unit is the standard 
deviation. 


; i if 
If we write m=— n=—, we have at once: 
O71 CO, 


Thus we find: fe a Gi % 


6 KARL PEARSON 


(vi) is an extremely neat formula and might be taken as the definition of a quantity 
measuring correlation. But the actual determination of correlation in this way, 
z.e. the reduction of each variate to a deviation from its mean measured in terms 
of its 8.D. as unit, would probably be as troublesome as using the product-moment 
method. 

One special case occurs, however, in which the above formula may possibly be of 
good service. Suppose the two variates have the same mean =m and the same 
8.D.=o, then: : 


RQU/ze aes 2 z a 2 

ahs. eee eas y= mei Oe (vii). 
Or, the coefficient of correlation is the result of subtracting from unity 7 times the 
square of the mean sum of the positive differences of paired variates divided by their 
common standard deviation. 

For cases in which both variates are the same, brothers, cousins of the same sex, 
homotypes, etc., and especially for some cases of short series, the method may be 
of value. 


Illustration I. Resemblance of Length of Little Finger in Male Cousins. I take 
a short series of 68 male pairs of cousins. The average value of the length measured 
on the little finger was 51°02 mm. and its standard deviation 2°721 mm. There were 
33 positive differences of finger length giving S(«—y)=87°6 mm. Hence we had: 


pif ap $8188)" gg. 


7°4036 


Found by the product-moment method the answer was ‘287; the difference is 
well within the probable error of the latter value. The process of taking differences 
and summing was considerably shorter than finding a product moment. 


Illustration IT.  Assortative Mating in the case of Paramecium. I take 
Dr Pearl’s Table AA3 from Vol. v. p. 295 of Biometrika for the lengths of 
conjugating Paramecia. 

I choose this purposely because there was no difticulty above about the male 
cousins ; there were only two equalities, the actual measurements of each individual 
being recorded. But in an ordinary correlation table owing to the method of grouping 
there will be a very considerable number of ties, and the problem arises how are they 
to be distributed. Clearly one half of them will be excesses and one half defects, 
if we suppose the odds against an actual tie in measuring to any degree of accuracy 
to be very great. Hence we may say that half the diagonal total is to be treated as 
in excess. But at what portion of the base unit are we to set the pair apart? If the 
frequency was uniformly distributed over the diagonal cells, we should take the average 
interval between a pair to be 4 the base unit. But the material is almost always 
clustered inside the cell, and clearly $ is too much. The actual value to be taken would 
depend upon the value of the correlation and the size of the base unit. In fact we 
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can only take a rough approximation. 


fairly well. 
by the base unit. 


I suggest that 4 will be found to work 
Accordingly we take 4 of the contents of the diagonal cells, multiplied 


The whole process may now be written as follows : 


& 1 2 3 rie 6 7 8 Cee ton) ott #12 
0 1 1 1 0 [1 0 = = = és #2 se 
ee re ewe Oli a a 
eee) el CPO ge oe 
Beets een ee tee er 08 ae 
mse eh ee le Os 
oem DS 00 Oe 
LOG Lo 7 1 2 0 1 16°3 
16 4 2 0 0 0 2 85 
4 5 3 0 0 4 76 
OOO 02) 4, b4 
avisio16 < OL 18 16 
OF Ol. 88 20 
0 85 12 
98 o=19112*, — S(w—y) = 2793 x 10 
2793 = 
pees Ae ne 
Sas (ai x Baie : 
The value obtained by the product-moment method is ‘588 + ‘022. 
The correlation Table is as follows : 
Length of First Conjugant. 
160-9 | 170-9 | 180-9 | 190-9 200-9 | 210-9 | 220-9 | 230-9 | 240-9 | 250-9 | 260-9 | 270-9 280-9 | Totals 
~ : 2 y 
Sf1G0-9 (ee Whe” Ped d Y de Lh) — Se ne 
SUPA T09 (hicbecd. De.-h iy artes) acetate oy | mt oe : 5 
Papier Otte atone ds hrs Siete yp oe | Py 15 
3S 190-9 1 — 3 4 14 7 5 1 1 — —_— — | — | 36 
200-9 | — | — 4 14 30 25 9 5 1 i — | — —- | 89 
5S |) 210-9 I — 1 7 25 22 16 5 5 -— —- a 82 
3 220-9 | — 1 — 5 y 16 10 16 7 1 2 — 1 68 
Sees C ot = oe 1 5 5 16 16 4 2 — | — 49 
“| 240-9; — | — |} 1 1 1 5 7 4 54 5 3) — | — | 31 
= 250-9 ;} — | 1 _ 1 2 5 4 — — | — /| 13 
en ee Ne eee ame ee fot | | 7 
S | 270-9 E mete Sa ee | Seat SILLA jh Q 
= 280-9 | — | — | — | — |] — _ 1 — | — | — | — — | — 1 
L 
N a ee ee : | 
Totals 4 5 15 36 | 89 82 68 49 31 LG || cath 0 1 400 


* Pearl, loc. 


cit. p. 226, Table II. 
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we proceed thus: Read each column down to and including the diagonal cell, and 
place the total under the corresponding differences in the previous scheme. For 
example, take the sixth column; 1, 0, 1, 7, 25, 22, are the corresponding frequencies, 
and these numbers will be found, sloping from the column marked 5, i.e. difference 
5 x 10, diagonally across the scheme. In this manner the columns of the table can be 
disposed in the scheme at once. The scheme columns are then added up and 
multiplied by the difference at the top, and, if multiplied again by the base unit, 
in this case 10, the total gives S(a—y). The whole can be done with very great 
rapidity, and the correlation found in about 10 minutes if 7 be known, 
As other comparisons I give the homotypic results : 


Difference method Product method 


Monmouthshire Ashes (65,000) 432 405 +011 
Papaver Rhoeas (Quantocks) (19,790) 523 533 +:013 
Ditto (Chilterns’ Base) (25,160) "395 ‘400 + 012 


These results show that there exists quite a-reasonable amount of agreement between 
the two methods, and the difference method is much the shorter when the table 
contains thousands of observations as in these cases. At the same time too much 
reliance must not be placed upon the difference method, not only because it assumes 
normality of distribution but because it involves a somewhat rough method of 
approximation in the case of the diagonal cell. 

One further point may be noted. Suppose that rank in a series was a true 
character which could be dealt with by a difference formula like the above then r the 
correlation of the ranks would be given by 


Now for such ranks o? = 7; (V’—1), therefore 


_ 127 {S (a—y)P (viii) 
NW? (VP) (ache ee : 


Dr Spearman has introduced a quantity R which he terms a “correlational 
coefficient*” and which he defines without any special justification by: 


al 


2st) 
R= iN? 1) bv aBeosten bo tee (ix) 
We should thus have: 
a (N*—1) , 
l-r= SSE (LD) aces 5 ents oe ee ce (x), 


which would give approximately: r=2R—R?. 


* Journal of Psychology, Vol. 11. p. 96. 
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This is, of course, not true, for the distribution of ranks is not normal; the exact 
formula will be given later; but it suffices to indicate that the actual distribution 
assumed for and y will much influence the relation between rand &. Dr Spearman 
from trial gives the empirical formula 


r= sin (4 R) 28 Syst ok ee OU REE CES (x1), 


which is also incorrect. But the above relation shows that we are not @ priori 
compelled to suppose that 7+ merely changes its sign, not its numerical value 
when Ff changes sign. 


(3) On the Correlation of Grades. A method of representing frequency has 
been introduced by Francis Galton in which the extent of variation of a character is 
expressed by the position of the individual bearing this character in the population. 
This method was originally spoken of as that of percentiles but more recently as that 
of grades. A fundamental feature of the method is that the grade is looked upon as 
an index to the variate, it is not considered as in itself significant, or treated as an 
independent character of the individual. In order, however, to pass from the grade 
to the variate it is absolutely necessary to make some hypothesis as to the nature of 
the distribution. The hypothesis hitherto made is that the frequency follows, at 
least fairly closely, the normal or Gaussian law. On this assumption, tables of the 
probability integral enable us to pass at once from the grade to the magnitude of the 
variate, and vice versd. Quite recently, however, Dr Spearman has proposed that 
rank in a population for any variate should be considered as in itself the quantitative 
measure of the character, and he proceeds to correlate ranks as if they were quanti- 
tative measures of character, without any reference to the true value of the variate. 
This seems to me a retrograde step; hitherto we have dealt with grade or rank 
(I will distinguish between them presently) as an index to the variate, and to make 
rank into a unit itself cannot fail, I believe, to lead to grave misconception. Between 
mediocrities the unit of rank treated as a measure of a variate is practically zero, 
between extreme individuals, it is very large indeed. To state that two individuals 
differ by m ranks carries no meaning at all unless we add, (i) the size of the population 
dealt with, (ii) the position in the population of one or both individuals, and (iii) the 
nature of the frequency distribution which governs the population. I cannot therefore 
look upon the correlation of ranks as conveying any real idea of the correlation of 
variates, unless we have a means of passing from the correlation of ranks to the value 
of the correlation of the variates, i.e. the correlation of ranks can only be treated as a 
step subsidiary to determining the true variate correlation. 


The correlation between variates can be made to change widely by preserving 
the same system of ranks, but by altering the nature of the frequency distribution. 
Thus consider the system : 
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Variates Ranks 


The correlation of variates is perfect and the correlation of ranks is also perfect. But 
we may also have: 


Variates Ranks 


y a FN 2 OL OR re 2 1 2 3 4 


The correlation of variates is now ‘72, but the correlation of ranks remains perfect 
and would indicate nothing of this great difference. I think that it is safe to assert 
that until some assumption is made, at least as to the approximate nature of the 
distribution, we cannot hope to avoid misconceptions if we use the method of ranks 
without reference to the rank as index of the variate. 

In such a case there can hardly be a doubt that the best method is first to 
consider to what results normal distribution will lead us, and secondly if the formulae 
found turn out to be of a simple character to adopt these as the basis by definition of 
the variate correlation constant as found from a method of ranks. This will be 
the course adopted in the present memoir. 


(4) Let there be a population of N members and let these be under investigation 
for two correlated characters, means ™,, m,, standard deviations o,, o, correlation 7. 
I shall suppose normality of distribution. Let m,+a, m,+y be the deviations of the 
two characters in any individual. Then I term: 


if Eee geet 
payee [e or dx 
2rgith ios LSU O Raia (xii) 
ving od (she 
q,=o a eC Og ‘ 
: i J/2re, 0 4 


the x- and y-grades of the variates for the individual. It will be obvious that g, and 
g, are mathematical functions of the variates and that accordingly the correlation 
between them determines that between x and y, or vice versd. 

Obviously g, and g, can be found from tables of the probability integral as soon as 
x and y, the deviates, are known. 

I term rank the actual position in order of an individual with regard to any 
variate in a given series obtained by measurement or observation. If », be the 
‘rank’ of an individual for a given character this signifies that in the observed 
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population there are v,—4 individuals with character greater than x. If therefore we 
were to identify this with the grade we should have 


or g, would always differ from a whole number by ‘5. This, of course, it does not, 
and the whole problem of working with ranks really centres on the degree of 
approximation which is made when we proceed from ranks to grades by the relation 
(xii). A grade determined from a rank and not from a variate we may term a 
spurious grade; actually the real grade often differs by several units from the spurious 
grade, and the practical problem is: To what extent does this vitiate the use of ranks 
as a subsidiary stage to the determination of variate-correlation ? 

I shall first proceed to find the mean and standard deviation of a true grade; 
(xii) shows us at once that g,=9,=4N is the mean value of the grade. 

The frequency of a given variate lying between # and «+ dx 


N 
. J 2, 


But the frequency of the variate must also be the frequency of its grade, or: 


yee 
e *ordx=dg,. 


+a n Aap N 
NG Se =9,)’ dq, =$ | (a ss h:) i 
"2.N* NN? 
i 
Hence we have: CS St ay Se Sena rere (xiv). 


Now whereas our grades are a continuous series, the spurious grades or ranks are 
discontinuous and at intervals h=1. (xiii) shows us at once that 
v,=),=9,+4=4(N+1). 
Further C= 0,7 +h’, 
the latter corresponding to the Sheppard’s correction by which we pass from raw to 
adjusted moments. 
Thus we have : eee | 


Vy 


5, =9j=4 (N +1) | Mane He, Su eealais ces cacn ses (xv), 


(xv) must be used whenever we are dealing with ranks or spurious grades. 
Writing 7,=9,—g, and 1,=g,—g,, I now turn to the determination of the product 
moment of the grades. Let us put: 


1 2 ; 2 

1 1 Ly f (c = Mey) s) 

Z= e (1-7?) \o? oo, og 
2700, v1 -—7 


E00 £2 sis 
then : Pon=| | iuzdaedy 


gives the product moment of the grades. 
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Differentiate p,,,, with regard to r which is not contained in either 7, or 7,; we 


Wo IS iE isis G decay. 
But I have elsewhere* shown that : 


dz dz 


have: 


7 Tp 8 Tidy iiici juan acai (xvii) 
Accordingly : (Pow. = g0, | = | we 1% a dady. 


Integrating twice by parts and noting that the part between limits vanishes in 


teem o.o,| ee di, a Gea ae 


Substituting for di,/dx and di,/dy and writing x=a'o, y=y'c,, we find : 


EPo.o, _ N® ol tag ~ 911 —72) (2-1?) a? — Qrar’y! + (2 —1%) y? as, 
dr ae lee | ea =| sane 


2. bie - ot Oe 
ea wee) = r  WMm4—r- 
1—?7 (l—9r’) 


Now if p,, be the correlation of grades, we have : 


both cases, we have: 


1) —@ 


dl Led 
Pat Ng, On Nee eae 
Thus remembering (xiv) 
dpi 8) thee 
dr /4—7*’ 
or, Pw a sin7'47 + constant. 


Now p, and 7 must vanish together, hence the constant is zero. Accordingly we 
have: 


This remarkably simple formula enables us to determine the value of the true 
variate correlation from a correlation of grades on the assumption of the normal law ; 
or if grades may be replaced by ranks, a knowledge of the correlation of ranks will 
give us the correlation of the actual variates behind the order exhibited in the 
ranking. The important idea embodied in the above formula is the basis of the 
present memoir, and is as far as I am aware wholly new. 


* Phil. Trans. A. Vol. 195, p. 25. 
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It remains for us to consider methods of finding the rank or grade correlation and 
the probable error of such methods. 


(5) A convenient method of finding the grade correlation is that of formula (i), 


p-. 4, we have at once: 
= Carga 5 (9: 9.) NV 


° 


20,0, 
O me 2 
Or, Pu=l— pase 2) eo Weed Wg AM eco ee es aA (xix), 
if we use true grades, 
6S (», — v,) 
but: Te Nd) SR be Be BR 0a Oe RO co Sr eee Gon); 


if we use ranks », and »,. 


If we use ranks the discovery of S(v,—v,)’ or the sum of the squares of the 
differences of ranks forms a very easy process of determining p,, due regard being 
paid to certain points to be dealt with in the illustrations below. Then (xviii) will 
give the variate correlation. 

The probable error of p, and of r found in this way will be given in another 
section. 

Since the determination of p, by (xx) is algebraically identical with finding p, by 
the product moment, and such product moment gives the least probable error in the 
determination of a correlation coefficient, there must be some fallacy in a statement 
which has been propounded among the psychologists that a difference method of 
determining the correlation will give p, with about 2 of the probable error of the 
product moment method. This fallacy will be considered later. 

Meanwhile it is of interest to show that the probable error* of 


Pu = {S (v, v,)/ oo Vg} / (o,, o,,) 


P.E so SE at ee 2+, Px +65 Put --+) 
es Road 2 Pre 4 Pre 6 Pre eo) 


is of the form : 


where ¢,, ¢,, ¢,,--. are undetermined constants. Or, the probable error of p,, for p,=0, 
or for uncorrelated ranks is 
67 449//n—1, 
i.e. 18 absolutely identical with probable error of a coefficient of correlation of any two 
uncorrelated variables, and is not as asserted much smaller. 
Since for ranks o,, and o,, are constant, we havet to find the value of 


oie {Pe (nt yt, 


v, and v, being independent, in order to reach the squared standard deviation of 
Pauior p=. 


* nis here put for V as more convenient for the algebraic work which follows 
t I owe the following proof to the kindness of my friend “Student.” 


he KARL PEARSON 


wee Ped (Leola ce ata 


There being no correlation, n! arrangements of this product occur with equal 


n+1\* n+1\4 
se ! 
= ( 5 )=( 5 yin. 


Next any 4», occurs in (n— 1)! of the arrangements, for if v, be paired with »,, 


frequency. Hence 


the remaining »—1 pairs may be arranged in (n—1)! ways. Thus 


542 (et) S (nv fn }=2(n—1)! (nt) Er 
=2(n-1)! (“Et 2008 (v,) =2(n—1)! Ga yt Gs Nn 
=2(n)! (ey. 


Further : > (Se) => = {S (v2v2) +28 (yr, %'r)}, 


where »,', v,’ are different from v,, 1. 
Now vv occurs in (n—1)! arrangements ; hence 


> (Si) = Naa, S (v2) 3 (v2) = (n—1) iu (n+ 1) (2n+ “ly 


Vv nv 6 


=(n —1)! a pe ae 
6 
Next v,v,y,/v,/ occurs in (n — 2)! arrangements. 


) hea Babs, 5 ! 
Thus: S (= (vn Ys ) = fu : aN = (v.71) ) 


2 


= Vee (yp yy (v inh», & (uleee eee 


where v/ and »,’ may now take all values. 


Thus: 


» [PS esnita)) OE Eee) ® 3 (n(n) 21) (oon, ttn) +8 ins) | 


_(n —2)! {(" (n+ sy bes (" (1+ =a) n(n+ hes 1) % (" (n+ 1) (2n+ 0) 


wo \ 2 2 6 


Bes ! 
= At -(n +1) {9n? + 8n?— 8n — 4}. 
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Collecting the various parts we find : 


»_(n—1)! (n+1) (2n4+1) (n—1)! : : 
a 36 + aq (n+ 1)? (9n* + 8n?— 8n — 4) 
na +1\4 n+1 
—2n! : a CaaS 
ant (E) tnt (BSP), 
! 2 Es 
or, after reducing, eee (m+ 1) (n=1) 
144 
Therefore the mean value of u? is asc a ie es ry 


Now the probable error of p,,, for uncorrelated ranks : 
= *67449u/o,, CO, 


(n+1)Vn=1 jn?=1 


= 67449 19 12 


Sev VN OST ere heey cae sorore rere (xxl). 


It thus follows that if the value of p,, be not two or three times the expression 
(xxi), there is no significant correlation of ranks, and therefore no significant corre- 


lation of the corresponding variates. 


(6) On the Difference Method of finding the Correlation of Grades. 

Exactly as in the first section of this paper we may seek the correlation of grades 
by means of the sum S(g,—g,) of all their positive differences. This is slightly 
shorter than finding S(g,—g,), but only very slightly so, and it may be doubted 
whether the increased rapidity of working at all compensates for the decreased 
accuracy of the process. Still the result is interesting and throws considerable light 


on one or two allied points. 
Let G=S(9,—g), where the sum S is for all #-grades which are greater than 


corresponding y-grades. 
Let us put x=0,2’, y=o.y’, and write 


v 
iy -| e-2” dy, 
0 


I 1 ie, 
2a J1—r oo, 


Then: G= Sis mf G a —Jy) zdyd« 


ys) Zdy' da’. 
~ (2m) ee [Ge ives 


dy =) 


iS KARL PEARSON 
== Se ail. fot 7 ie 3) daft 
=Gatl_. 2 —jy) nan (Gp) ava! 
“i= | - {| ie hie seh oe z q y\ ae 


“(any oni. ls e-3y” clan d) dyfde 


; then after rearranging : 


Ca N? + oa ie ple Ae ” 2(14+7r) nl 
dr crac). J (0 (2 a ae sa) 


Sent a ‘+ Pe RASTY y" 
xe *er\Y 340%) T= B47)" J dy"dal. 


The order of integration can now: be transposed and if X be written for 


It 


Put y’=a'-y 


the limits of X will also be —o to +o. Thus: 


AG =t7q- elie 2(1+r) le Sy oe 
a VOT 1—r)X v itr” dXdy'’. 
dr Soria 1—r')3 {oe Nia ee ea Str 7 J 


But if ¢ have any value : 


1.) 


4- 00 + 0 peed) 
| Cut AGA = 0,geeod | e 40 1X = On. 


Hence: 


oe N’? (1+7) l+r 
dr (1—7°)2 ( ee at 3+97r 


[.v ea enon cy!" 
Vee! 2(1+7)? (1-1) (3+7) 
Or (l—r)? (8+,7)2 2 
peas 1 eg 1 
on Ji—r)(8+r) %V/4—(14+7r) 


Hence integrating : 


2 
G= Soret eae cos* be : 
20r Wh 


But when r=1, G must be zero; therefore the constant is zero, or inverting : 


= 2160S an 1. 
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Or, finally*: r=2 cos te (AEH) eae eet (xxii). 


This gives us the correlation of two variates from the corresponding grades by a 
difference method. 


If + be zero, we must have 27 Ae equal to 60°=7/8, or S(g,—g,)=4N° 


when there is no correlation of variates. This is easily proved directed, for in this 


case : 
+o [2' Bes 
G = 5 (Jz — Jy) = om [(o-a) = e732 +Y*) dy'da! 
ee he dg 
- rl, 0 (Gx Jy) Dy 6 Ix 


For ranks the corresponding expression to be used is ¢ (V*— 1), or we have: 


So) 


r= 2 cos 27 | 
(Set 


As before the truth of (xxii) depends on the approximation to normal correlation. 

If we combine (xx), (xviii) and (xxiii) we have the relation between S (v,—v,)’ and 
S'(v,—v,) which holds in the case of normal correlation. 

Writing R=1—S(v,—»,)/4(N*-1), we have: 


7 =2 sin : Pa 2 cos a Clee aI has Wise oe ane. ore (xxiv). 


Dr Spearman givest+ the relation : 


Da e1n : R) BUN MR EPR ee orca cs co's (xxv) 


(he neither connects p,, nor A, with 7) as apparently an empirical relationship and 
speaks of it as ‘‘all that could be desired.” It is clearly incompatible with normal 


* The relationship of (xxii) to (vii) is easily seen if we expand the cosine as far as the square of the 
angle. We have 
4r* S (91-92)? + 1S (9, —92)}? 
pet Se) SS Sey peek ee 
(vii) would have given us 1 instead of the factor 1:°0472. Thus when there is high correlation, or S (g; —g) 
is small, we see that the difference method with grades leads us to nearly the same result, as the assumption 
that the grades themselves form a normal distribution. This suggests that Spearman would have got 


N? Y as : 
much better results for his “footrule” for measuring correlation had he taken R=1-3 : (* es v2) ) 
oe 


N?-1 
’ ae 2 
instead of 1 — Aen v2) . for this value, i.e. 1 — (Fe) in his notation, would have been almost the true 


variate correlation r. 
1 Journal of Psychology, Vol. 11. p. 102, 
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correlation, which at any rate is a fairly good guide for general relations of this sort 
in the theory of frequency. Table I. gives the values of r and # for each ‘05 for py. 
Table II. gives the values of 7 and p, for each ‘05 of R, and in the last column the 
value of p, which would arise if (xxv) were correct. 


TaBLE I. Correlation of Variates from Mean Square 
Difference of Grades. 


| P12 4 h Piz y R 
00 000 000 || -50 518 323 
05 052 029 || -55 568 361 
| -10 105 059 60 618 400 
| 2 157 089 65 668 442 
20 209 120 || -70 717 486 
25 | -261 152 | -75 ‘765 533 
30 313 184 80 813 584 
35 364 217 85 861 644 
40 416 251 90 908 709 
45 467 | -286 95 |. -954 796 
| 50 |. 518-.| +323 1:00 | 1-000 | 1-000 
| | | 


TaBLE II. Correlation of Variates from Difference of Grades. 


| Bond r Pye (sx0)) 4] R r Pre (xxv) 
| 
00 | 000 -000 000 || 50 732 ‘716 ‘707 
05 | 089 085 078" |) 55 782 767 .| °760 
‘10 ‘176 168 156 || -60 “827 “814 ‘809 
‘15 259 “9480-3330 Ie 6p ‘867 “856 853 
20 | -338 324 309 ||” -70 902 “894 ‘891 
‘25 bh 4 398 383 ‘75 932 926 924 
‘30 | -486 469 454 ‘80 956 ‘952 ‘951 
35 | +554 536 ‘522 || 85 ‘975 ‘975 |, +979 
| -40°| -618 600 587 90 989 988 “Y88 
45 | +677 660 649 95 ‘997 ‘997 ‘997 
rma: gee ( 32 ‘716 707 1:00 1-000 1:000 | 1-000 


Now these Tables bring out several interesting facts. The first is the remarkable 
closeness between the correlation of the grades and the true correlation of the 
variates, if we suppose the system normal. The maximum difference as shown by. 
Table I. is 018 and actually the maximum of r—p, occurs when p,,=°5756 and is 
then ‘0180. Thus, the difference will often be of the order of the probable error. 
The formula (xviii) is so simple, that we can always deduce the variate correlation at 
once from the grade correlation. I propose to define ” as given by (xviii) as the 
grade-variate correlation. Whenever the system is normal, or approximately normal, 
this will agree with the true variate correlation closely. Next Table I. shows us that 
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equal differences of p, give almost equal differences of 7, Le. the differences only 
range from ‘052 to ‘046 of r for differences of 050 of p,. On the other hand the 
differences of 7 for equal differences -050 of R vary from ‘089 to ‘0038, or second 
differences become of importance. Clearly for high values of &, + will be found much 
more closely than for low values. 

If EH,’ be the error in 7 due to an error #,, in py, and H,” be the error due to an 
error H, in R, we have: 


TT 7 
E = 3 C08 & Pu X Ee, 


By’ =" sin 7 (1-R)x Ep 


if we use differentials. For the special case of p»=R=0, we have seen that the 
probable error of p,,="67449/Vn—1; it will be seen later that the probable error of 
R is :4266 IN een | nearly, and if H, be the probable error of *7=0, as found in the 
ordinary product moment way, we have: 
py pn. (6745 6745 Dar ,/3 *4266 
tend Uaisslac tes 8 n=l 
6745 "7063 7738 
peak au er Aten ott 
Thus we see that, contrary to what has been asserted, the accuracy of the new 
methods—when they are measured by the determination of the true correlation— 
are less than the old product moment method. In particular it requires about 
30 per cent. more observations by the R method to obtain 7 with the same degree of 
certainty, when 7=0. 

At present we do not know the & factor term in Hz, when Ff differs from zero, 
and accordingly cannot test H,, HL,’ and E,” at other values of A or p,, but I have 
little doubt of the general. truth of the result that H, is at all values as well as for 
r=0, sensibly less than #,’ and still less than £,”. 


(7) Remarks on the Probable Error of Rf. 


The probable error of a quantity in which the limits of the summation vary as we 
make random variations in the constants is always a troublesome matter, and I have 
not yet succeeded in evaluating the probable error of S(g,—g.) when g,>g, for any 
value of 7. 

Spearman has investigated the probable error of the corresponding expression for 
ranks, S(v,—»,), when there is no correlation between the ranks. He finds that for 
n observations the probable error of & may be taken as ‘43/./n, and from this result 
he has drawn rather sweeping conclusions as that: “twenty cases treated in one of 
the ways described furnish as much certitude as 180 in another more usual way” ; or 


that : ‘‘a probable error may at present be admitted without much hesitation up to 
3—2 
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0:05; so that by adopting the method of calculation recommended, two to three 
dozen subjects would be sufficient for most purposes*.” Now these statements seem 
to me not without grave danger, and accordingly it is well to see where the error has 
crept in. 

Spearman gives the value -4266/,/n, but it should be “4266 [vi n—17t, and accord- 
ingly since we have seen that the probable error of p, for py=0, is 6745 / /n—1, the 
probable error of & would only be about 3 of the probable error of p,,, and upon this 
Spearman’s statements are based. 

Now the probable error of any quantity is conventionally ‘67449 x standard 
deviation //n — 1, and accordingly for the same number of observations the probable 
error is less when the standard deviation is less. But there would be no meaning in 
asserting that the mean of 20 metacarpal bones could be found with much more exacti- 
tude than the mean of 20 humeri, because the latter being a larger bone had a greater 
variability. We must either measure the same quantity by different processes, or 
else be at any rate certain that our quantities are alike in character and function 
before we compare their probable errors. The probable error of 3a is certainly less 
than that of «. Now p, is a true correlation and ranges from +1 through 0 to —1 
with a symmetrical distribution about 0, if we take the case of a random distribution 
of ranks. The quantity R presents nothing of this nature at all; random distribution 
of ranks does not give a symmetrical distribution for R, its range is not from +1 
to —1, and there are certain values it can never take. In order to bring out these 
points I take the following table for & negative. 


Taste Ill. Negative Correlation of Variates from Difference of Grades. 


R | uP Piz 

N.B. It will be observed that when # is 
=—05" ). —= 027 1 085 negative, the true variate correlation is almost 
~10 | — ‘187 | — ‘T78 | double the magnitude of #, while if 2 be 
ze a | Py ie ai ae positive (Table II.) 7 is larger than # but not 
— +25 | — +482 ZAG to this exaggerated extent. It will be clear 
—'30 | — ‘584 — +566 that no estimate of the real correlation can be 
— °35 Is ‘687 = 070 based on &, if it does not allow for this ex- 
- re ies ee é a aggeration. 
—50 | —1:000 — 1-000 


* American Journal of Psychology, Vol. xv. pp. 100, 101. For the proof of the probable error cited 
see: British Journal of Psychology, Vol. 11. pp. 105-8. 
5 : ‘4266 ne? — 3°5 : =o 
+ Spearman’s result at bottom of p, 108 may be written Cea ee or neglecting terms in a 
n— ze : 


i| iF se 
not 7 he does, this gives *4266/ Vn-—1 as we should anticipate. 
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Thus we see that while 7 and p,, run from —0°0 to —1°0, R only runs from — 0:0 
to —°'50. 

In order to obtain his probable error for & Spearman takes every random arrange- 
ment of ranks », and »y, for which », is greater than v,. He has neglected to observe 
that when he does this his & will become negative, but that it will not range 
from —1:0 to +1°0. For example, I take the following system of ranks for (2m+1) 
individuals : 


| | | | 
ie | 1 ge ts JS als | home? 
| | 
Va 2m+1 2m | 2m— 1 2m — 2 | 3 | : 1 | 
| 
This gives : S(v,—v,) = 2 {(2m)’ + (2m — 2) +(2m—4) 4 ... +27} 
= 8m (m-+1) (2m+1)/6, 
AISI _ 8m (m+1)(2m+1) Es ahi 
7 Pa N(N1)  (@m+1)(2m+ 1-1)” 
But S(vy,—v,)=2+44+...+(2m—4) + (2m—2)+2m=m (m+ 1). 
Ve eal. (04 Pi 6mim+1) ip 
Therefore : Jane 2 ayaa IC eqn Dee nc te csras (xxvii). 


Accordingly when the correlation is negative and perfect, the number of observations 
being odd, # will never take the value —1, but no greater value than —°5; whereas 
if we reckon our second ranks in the negative direction # will equal +1. 

Here the Spearman formula (xxv) leads to the absurd result p,.= —1//2, instead 
of —1. On the other hand my formulae (xxiv) for p,=—1 and R=—°‘5 give 
absolutely the correct value += —1 for the variate correlation. 

Again take N even =2m and consider the system : 


| 


| | | 

| i — 1 2 3 | 4 | an 

| | | 

t= 2m 2m—1 2m —2 2m—3 | 1 
We find: S (v,—v,)? =2 {((2m —1)P' + (2m — 3) + (Qm—5)P+...4+ VI} 
== (4m —1), 
Waa 
6 (4m? = 1) 

and this gives Py = | ->———— = - 1. 


2m (4m? — 1) 
Again : S(v,—v,)=1+3+54+...+(2m—5)+(2m—3)+(2m—1) 


=, 
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a Gn > ane Lae apts cae 
Hence R=1- 4m? — Hl = nines =—'5 {1 Tesi Sie veliciers (xxviii). 
For: N=4, R=-'600; N=10, R= —°'515, 


N=20;' R= ="504- N=100, R= —-500, 


Or, again, the limit’ —‘5 is rapidly reached as the number of observations 
increases. In fact solely for the simple case of two observations is it possible for R 
to reach —1. 

If it be objected to (xxiv) that it would now give for values of R, greater than 
— ‘5 values of the variate correlation greater than —1 (= —1'09 at a maximum for 
N= 4), this is overlooking the point that (xxiv) is deduced from (xxii) by replacing 
true grades by spurious grades or ranks, and that if we retain (xxii) then 

S(1-9)/N*=7,=4, 
and r= —1 as it should do. 

We have now reached I think the basis of Spearman’s apparent paradox. While 
the variation of the true rank correlation p lies between +1 and —1 and has 
67449//N—1 for its probable error, the value of R only ranges between +1 
and —‘5, and may well have a less value for its probable error. 

Now Spearman tells us that large negative values of his R should be avoided*. 
There is no necessity whatever for avoiding them if we are seeking the variate 
correlation by the formula given in this memoir. But if we are seeking the probable 
_ error of a zero quantity, which may vary on either side of zero (and in this case the 
variation is not symmetrical about zero), we cannot neglect the distribution of random 
variations below zero. If Spearman wishes his R to be considered always positive, 
then he ought to have found the probable error on the assumption that S (»,—v,) 
should never be greater than }(N?—1). He has taken a quantity which ranges from 
+1 to —‘5 and compared its random variations with one which ranges from +1 
to —1 for the same frequency. If he had restricted his attention to variations of R 
between 0 and +1 and of Pw» between 0 and +1 he would not have reached the same 


conclusion. 

But there is a further very serious indictment to be made against Spearman’s R, 
For values of N fairly small, which are those for which he proposes to use it, R 
retains a constant value for wide variations in Pu. We can show this on an 
exaggerated scale by writing down the possible values for Spearman’s # and the true 
rank correlation for 4 individuals taken with random ranks. See Table on Des: 

A little consideration will show to what much better results Px leads us than R. 
f in fact remains constant and = —-2 while Pw» passes through the values 0, —:2, —:4 
and —‘6; or 7 can take values from 0 to — ‘62, while its value as found from R 


* Loc. cit., footnote, p. 96. 
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remains —°38. This simple illustration of how the real rank correlation varies 
widely while Spearman’s coefficient R remains constant shows how unsuitable the 
latter is, when we have to deal with small series. 

Another point worth noting is that, if we take the positive values of the 
correlation only, the mean value of # is ‘3818, while the mean value of the corre- 
sponding p,.'s 1s 5454 ; the former has a standard deviation of :2622 and the latter 
of ‘2573, showing that we are not justified in asserting that # has a smaller probable 
error than p, when we take comparable quantities. 

Spearman appears to have an idea that # is really a coefficient ponperaie with 
Pn, and he attempts to get over some difficulties which have arisen, by telling us to 
reverse one series of ranks when R& comes out negative. But reversing the ranks 
does not aid us to the right result. Thus if the ranks in the 12th and 13th column 
of v, above be reversed, we find that A still remains negative and of the same 
magnitude —*2. In fact it is easy to write down a system of ranks which give a 
negative f, and which on reversal give a negative # six or seven times as big. The 
fact is simply that & is not a symmetrical function of p, and reversal of ranks does 
not necessarily reverse p, in sign. 

We see accordingly (i) that the total range of R is only about % that of p,,, and 
that if we make the range the same by any attempt to reverse ranks, the Spearman 
method of calculating the probable error for R=0 is erroneous. (ii) That the 
distribution of R for random rankings has a median which differs from zero, is very 
skew, and is in no ways comparable with that for py. 

A point to be borne in mind most carefully is that for a given value of R, p, the 
true rank correlation may take a great variety of values. It is only when (i) the 
number of observations is fairly considerable, and (ii) we assume some distribution of 
associated grades such as that of normal correlation, that we are able to assert that 
the value of A will fix p,,, but such a relationship as that connecting p,,, R and the 
variate correlation 7 can only be fixed, as in this memoir, by the appeal to despised 
mathematical analysis. 

Thus the advantages claimed by Spearman for A, namely: (a) that it frees the 
discussion from the complexities of mathematical analysis, and (b) that it gives a less 
probable error than more usual ways of approaching the subject, are seen to be 
illusory. 

The dithculty that p,, may take a whole series of values for a single value of F is 
only surmounted if we define the character of our frequency distribution, and there 
is no doubt that we shall obtain a first approximation by defining it as. normal. 
Secondly, we cannot reverse ranks with the effect Spearman proposes, and if we could 
his probable error of & for R=0 would be erroneous. Lastly, if we do not reverse 
ranks, then the probable error of one and the same quantity, the variate correlation, is 
considerably greater—for the only case yet worked out—i.e. R=0, when found by 
Spearman’s method, than when found by the well-known method of squares of 
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differences, and still less than if found by the product of the variates directly. The 
squares of the differences of ranks can be taken so directly and quickly from a table 
of squares, that it does not seem to me that the slight rapidity gained in using positive 
differences of ranks is of any weight against its increased inaccuracy for small series, 
where indeed it is likely to be chiefly used. 

Further no two rank correlations are in the least reliable or comparable unless we 
assume that the frequency distributions are of the same general character (see p. 9), 
and this general character will, till further advance be made in the theory of skew- 
correlation, be undoubtedly that provided by the hypothesis of normal distribution. 
On this assumption Spearman’s suggestion of correlation of ranks becomes valid, but 
not as he supposes as a Ding an sich, but only as a means of passing at any rate to 
an approximation to the variate correlation, and this in the case of quantities where 
it is easier to rank individuals than to measure their attributes accurately. 

For the grounds stated in this section, I propose to use as a rule p,, and not & to 
findv. For this reason I have spent my energies in finding the probable error of p, 
instead of seeking that of R. 


(8) On the Probable Error of the Correlation of Grades. 
The following investigation is admittedly lengthy, but I have not seen my way to 
shorten it, and the main point is to reach by some road the expression for the probable 


error. The most general expression for the probable error of a correlation whatever 
be the distribution is to be found from °67449>, where* : 


seu” {Pal Px Pot Pan (Pe Pw | 


cz N | P " 2 P 20 P 02 4 P Py 4 P - Pu Pr Pu Px» J 
and ITI esas sce ccersenee oss (xxix). 


Now in our case « and y are to be the grades g, and g, and 7 is to be 


Pu = Pu/ Mi Pe» Fay 
which we will write for this investigation p. 
We have at once: 


+0 -, 
NX Das =| (4.-9:)” 


= b,2 2 
e ION dy, 


—» /270; 
iene (a a 
and : n=h+ 5 | e “o' da, 
1 0 
N -3- 
dg, === e “dz. 
3 J 2, 


* Mathematical Contributions to the Theory of Evolution, x1v. “ Draper’s Research Memoirs,” Biometric 
Series 11, p. 20. I have omitted certain terms which cancel. 
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- 
Thus: Pwo =H | (9,.—9;)”" dg,=0, if m be odd, 
= ; (4N)”, if m be even. 
Thus Po=PuxsoNs and Po=Pn=pyN", as before ............ (xxx). 
We can now write (xxix) in the form: 
9 us 22 9 De 2 5 
3) = 7 {25 ( +p!) —2p Pe + siop'h Mien satee Ne (xxxi), 


assuming as we shall show in the sequel (p. 30) that p,=p,. Accordingly we have 
now to find p,, and p,,. 


: Ih Z = 
First to find: Pu= Hy {S$ (9,-9:)° (92-92) 
or if we use the notation of p. 11, 
bead free Wfcrcel oe ae 
| Pa= HN | a Un COMO ee tiear ee «theses are oe (xxx). 


Now I have not succeeded in integrating this expression, although I have spent 
much time over it, but I have expanded it in powers of the variate correlation 1. 


1 ra 


1 , ed i 
If ee gS Tet aero ie 


and j,, and j,, are the same as on p. 15, we can write: 


Ni (tee 2 ee 
pa= 5 | limes Jy» Uda dy’. 


aeeney 


' y Bern 
But* U=S— UnWne * ene 
N3 y 
3 Pam 53'S (73%) 
+0 2 + 00 
aes eI Sag sk it: as 
where (<= | je odae= | Fy e BY Wi OY Was attioaee (xxxiii). 
=@ —a 


If n be odd, v, and w, have odd powers and q, =0, hence p,, contains only even 
powers of 1. 


fares 48 
First : CoS: | jee 2” vada, 
where we may drop the dashes from the letters now, and 1,=1. Therefore : 
(ALAC) egen leee Pee Pee XXXIV 
a-| Ja dn OU =3 BS a eyo ( ) 


* Pearson: Phil. Trans. A, Vol. 195, p. 3. 
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Now” : iC Se = ohms!) and ai 
+a +0 
Hence: In = -| jo d(,_,e- 2” )=2 | Fee Un dx 
PAM he ae 4 
==| Ine *duy= -5 | v,(e 2” — 2xe 
i n 
Pe EPs <4 4 
= — | Une 3* ae+* | due LU, AX. 
is apts n co) 
aad dv, th 
whine Pag Vays s Te a : us 


yaa (ee 842 2 
= =| Une dat + Insst 2Gu- 


+a 
not hn | Ve 2” a= PRAY, a cts adsie toss: (xxxv) 
It now remains to find £,. 
+a _ 842 +@ ety 
eal Une 2 de= | {av,_,—(n—1)v,_,}e72" dx 
+0 +a 
ints me (ev, de—(n=1) | (Bes MS da 
= +0 ei +@ 
= e 3 *| ea Un, 2 bK — (n— | Un—s gone dx 
Pe: 872 
= -#(n-1) | Une *” d= —8(n—1)B,_, 
= (—2)?” (n—1)(n—8) (n—5) ...... 1 x B,, being of course even. 
+0 sedge oe 
But B= | ve 2” da=/ 8m. 
Thus we have the reduction formula: 
Onsa + 42Gn =(—%)2” (n—1) (n—8) (0-5)... Lx cs ere (xxxvi) 


* loc. cit. p. 5. T loc, cit. p. 4. 
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We can now rapidly calculate the q’s. 


ke x ,/ 2 __5 [20 
NWR ten EE RS ie) e 
ole Ze _ 166 /20 2352 [on 
29 8oN (8 AON el eae aN) ie 


This is probably more than sufficient for most practical purposes. Evaluating the 
coefficients numerically we have from (xxxiii) : 


ate 1 + 607,927 17 + °140,723974 + 036,77587° 


20 
-++010,258797 -45002,99339" =o. ee ee (xxXxvii). 


To test the accuracy of this result—obviously correct for r=0—consider r=1. 


We have: 


( Desf Dis) tL ORB BO: Acre meee (Xxxviii). 
But in the case the variate correlation surface becomes a ridge and 7, =2,, or: 
ey tre be ee 
st =e 1, ———e “dz 
(Pa) WN ie 5 J sa, 
+ N/2 : N4 
Ne fs? Ne 80° 
5 144 q 
Thus (2s) =a TB.y casas reece ae eae ee (Sxxix 
20 /r=1 


The difference between (xxxvili) and (xxxix) is only -001,3212 or about -07 per 
cent. Thus even if we omit the term in r”, we shall -be less than ‘2 per cent in error 
in this extreme case, when the probable error itself is zero; and for lesser values of r, 
where the probable error is sensible, we shall not be as much as ‘01 per cent in error. 
This is amply sufficient for statistical purposes. I now take p, and find its value in 


a different manner. 
1 +a +o See , 
Pu=Hy | bee 1° 2,z2dady, 


ml 1 ty tg © dedy eet bed | ea: Tedy (U- 


This can be integrated twice by parts, and the part eS limits vanishes at 
each integration. Writing x=0,z', y=o,y' as before, we have: 


{Ps ee gC tle ak) ie ee ee 2 
=o. ee re 2’ da’ dy’. 


. . + 2 9 + . * ° 
* This is found at once from q, = 2f Jul Uae OE, OL aUs =- | jnde-™, since %_,=2. Thus 
A al.) 


= P aa] 
a= | eB da = 2a 7: 
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The integration with regard to y’ can now be completed and we find : 
Ap, _ 3N* 1 [e -3 ee de 
dr (J2ay V2—7 


—o 
i ey Le AS 
where Wee G2 5 Od 
0 


and we have dropped the dash on # as no longer of service. 
Write m=(4—7°)/(2—7°), and we must now find: 


amd Leas 
ine | AL UCTELG Jock OA ih 
Now 
dl + 1 
Eig? Wee ‘ I ey sue 2 
b.8 gM —tmuv 2\ eee a ees - —$(m+l1) 2? | 
Saale a. (% 93.) ep alk wee da, 
I 1 sc ae 1 2 
Se dle? (m+1) a 
Bm * m (m+ se (e ) 
IMEL 1 oma 2 J Qa 
or: ee a See R a eee oe eee —3(m+2)a dx= — : 
dion aman! .° se m(m+1)/m+2 
thus : iva (/mTI) = — Vom . 
dm (m+1)V(m+1)-1 
we a 1 ze 
Thus A J/mIl= constant — /Q4 cos? eet BODO DODD yO O OCG (xlii). 
To evaluate the constant* put m=1, and we have: 
constant = J,,_, +/2a cos? 4 
+n ee) Ty 
-| e384 8d +) 20 3 
tr/nf2 Sty , 
= aby = =177/,/2. 
loa FE NC a 
J 2a {a 1 
; fio B Ble? A) Cin oh ieg oe 
Or finally grant are rf 
= 32 SAE Oe eer ree xlii 
a sin (xliii) 
* Mr L. F. Richardson has shown me that if we put m=0, since the inverse cosine now vanishes: 
constant = Limit s=« of 27 | oe = e73 2 j2dx, which he has evaluated with the same result. 
-0~ s7To 


30 KARL PEARSON 


Returning now to (xl) we can replace 7m by its value in terms of r and write 


a [Dx eLOe ee ee ty 
dr pais ae mee: sin Gorin eee (xliv). 


This expression I have not succeeded in integrating. I have therefore expanded it 


in 7 and then integrated. ‘Since p,,=0 for y=0, we see the constant is zero after 
integration ; thus after some troublesome expansions I find : 


1 54 
oi a: aS {:339,83697 — -005,48207" — 003,6798r* — -001,18367'}......(xlv). 
20 


The value of p,, is clearly the same as p,, for nothing would be altered if « and y 
were interchanged from (xl) onwards. To test the accuracy of the result, suppose 
r=1. Then we have from the ‘ridge’ : 


eae aay] i e~ 27lov’ de and i,=%, 
J 20, 
+N/2 . f Nt 
=H a 
: Ps.) = 1°8, again 
or: ies 4s » ag: 


But (xlv) gives us : 


(2) = 1°8 x 1:00153, 
Po (pil 


that is a result at a maximum only ‘15 per cent. in error and correct enough for all 
statistical purposes. The next step is to determine the powers of r in terms of p, and 
substitute in the expressions just found for py/p,? and p,/p,’. I find: 


== 1 + 666,6667 p+ °108,3084p'+ °019,7955p' + 002,7683p%......(xlvi), 


20 
Pa 
To verify we note that for p=1, these give 1°7975 and 1°8005 instead of 1:8,— 
quite sufficiently close for the purpose in view. 
We now substitute in (xxxi) and find as far as p* that : 


and : ; = 1:947,1220p —-123,4135p* —‘019,4138p" — 003,8120p" ...... (xlvii). 


Sis na — 1827,577 3p' + “688,4687 p' +°112,7773p" + 020 2900p) (xlviii). 
I throw this, by dividing by (1—p°)’, into the form : 
= = ie a tk {1 +°086,2113p?+ 012, 9408p*+ 002 AT 57p'+-000,0822—') 
or, dropping ee decimals : 


‘= Fae bi hs 086° 4° 015 p= 00 2p" 2. vac. Same ee dene (xlix). 
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Thus we see that the distribution of grades being very far from normal, the 
probable error °67449%, of the correlation of grades exceeds the value ‘67449 (1 —p’)//N, 
which it would take on the hypothesis of normal correlation by a factor which can 
amount to about 10 per cent. at a maximum, but gives 0 per cent. excess when p=0, 
then agreeing with our previous result. 

I propose now to find the probable error in 7 as determined by grade methods in 
terms of 7. This involves expressing p and p’ in terms of 7; these are easily found 
from the known expansions for sin“'w and (sin™’a). We have: 


1+4p?=1+4°455,945397" + -037,99547* + °005,06617° + 000,81427°%, 
2p = 1°909,8593r + :079,57757° + 009,26507° + °001,33227". 


These must be used in (xxxi), which may be written in the form: 


pi +18) (1+4p')-2 jaa 8). 


Saat ts 
- =H (2 


Hence using (xxxvuli) and (xlv), we deduce after some troublesome multiplications: 


yay =yi{l — 1°666,55077° + °433,61307* + °161,83377° + °049,50427"} 


= = (1-7) {1 + °333,44937° + 100,5116r*+ °029,40767° + ‘007,80787°t. 


But since: r= 2 sing Ps 


or = 5 008 @ PX Op and 2 =—> ee 
Thus: 


3=5 zetia ae {1 + 083,44939? + -017,1493r' + -004,2797r* + 000,45597"}. 


Taking the oe root we have: 


2 = 1" 0472 + N aT +041 72467" + :007,70427* + -001,81847* + °000,12247*t, 


“IN 


or, for all practical purposes, the probable error of 7 found from the grade correlation 
is, . 


P.E. of r= "70683 --” {1 + -0427°-+ 0081+ 002M} acceseseseeees (1). 


IN 


Clearly for all values of 7, this is larger than the probable error of the correlation 
r found by the product moment method, i.e. °67449 ae . The maximum difference, 
as 7 approaches unity is 10 per cent. The value can always be found from (1) without 
any trouble. The completer value is singularly close to 


706338 1-7? : 
a hy, Mae aicnbratne Gaels cneee hi), 
JN {i—4r}3 o) 
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but no advantage is gained in calculation by using this form, as tables of powers of r 
up to the 6th exist*. 

We see therefore from this section that whatever be the value of r, then for 
normal frequency the probable error of 7 found by the product moment method is less 
than the value found by the correlation of grades. Further there is no reason for 
supposing that the probable error of 7 found from the difference of grades (f) is not 
greater than the probable error of r found from the product moment of orades. 

We accordingly conclude that the new methods are less accurate than the old. 
But they possess some advantages,—when ranks can be easily determined,—in 
rapidity of calculating, and there are undoubtedly cases where they can be used 
effectively. In saying this I must reassert that I do not believe there is any advantage 
in the knowledge of rank correlation in itself; I look upon it as a mere stage to the 
discovery of the variate correlation. For the comparability of rank correlations 
depends upon the sameness of type in the frequency distributions, and this assumption 
is the weak step in the method. Granted approximately normal distributions, then 
the variate correlation flows from the rank correlation, and the whole investigation 
gains a rich significance. | 

My remaining sections will be devoted to illustration of the new methods and 
their comparison with the old. 


(9) Illustration III. Correlation of National Debt and Population. 


The following table is based on data for the year 1900, and raises no pretence to 
exactness, or financial accuracy. It is merely illustrative. 


Tasie IV. Population and Indebtedness of Various States 1900. 


sig | Reread ee 
Russia 129.20 1097-0 1 2 =) 1 
United States 76°40 200-0 2 8 1G 36 
German Empiret 56°34 649-4 ee 4 —1 "ad 
Austria 47-01 2208 4 ff =) 3 
Japan 43°80 51:5 5 15 —10 100 
United Kingdom 41:60 705-0 6 a +3 9 
France 38°64 1242-1 a 1 +6 36 
Italy 32°10 500-0 8 5 +3 9 
Turkey 20-30 162:0 9 9 0 0 
Spain 18-10 385-0 10 6 +4 16 
Belgium 6°82 106-4 11 ti 0 0 
Roumania 5:50 58-0 12 14 = 4 
Sweden 5-14 18-6 ile ive —4 16 
Holland 5:10 95:6 14 12 se) 4 
Portugal 4-70 155-0 15 10 +5 25 
Argentine 4:50 86-4 16 13 +3 9 
Switzerland 3°30 3°6 17 20 —3 9 
Greece 2-40 28-0 18 16 +2 4 
Norway 2:20 Vey 19 18 +1 1 
Denmark 2-18 11:6 20 19 +1 1 
— = 30 = S (v,— V9) |290 = S (vy, — 1)? 


* See Biometrika, Vol. u. p. 474. + Imperial debt and sum of state debts. 
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Hence : N*—1=399, and p,»=1—6 x 290/(20 x 399) = °7820. 
Further : R=1-6 x 30/399 =°5489. 


These values are obtained in a few minutes, if the ranks have once been written 
down. If p, only be required, we need not write down the v,—v, column at all, the 
squares being placed down straight away from the rank columns. 


Now applying equations (xxiv) we determine : 
r="7962, found from p,, 
=°7810, found from R. 


The probable error of r found from p, as given by Equation (1) is 0596. Thus we 
conclude that 
r='80+°06, found from p,, 


='78+ > 063, found from R*. 
If we turn to the much more laborious method of moments, we find: 


Mean Population = 27°26 millions; Mean Debt =289°7 million £, 
S. D. Population =31°'74 millions; 8. D. Debt =357°9 million £. 


Now these results in themselves should be sufficient to warn us that both distri- 
butions are very far from normal; for the S. D.’s in both cases are greater than the 
means, and since in a normal distribution, we might easily have a deviation equal to 
the 8. D. we should on that hypothesis expect to get negative debts and negative 
populations. The distributions are therefore very skew, or in clubbing together great 
and small powers, we have introduced excessive heterogeneity, completely destroying 
any approach to normality+. If we work out the value of + by the product moment 


method, we find : 
r='68 +'08. 


We see at once that the rank method has so exaggerated the correlation that it 
has made the probable error of the less exact methods less than the probable error of 
the more exact method! The explanation of this lies simply in the fact that the 
system we are dealing with is not normal. If the ranks of two variables were those 
given in Table IV, and the distribution were normal, then the variate correlation 
would be °80 ; it actually takes the value ‘68, and this is a very good illustration of 
how much the nature of the distribution may affect a judgment from ranks. 


* The p. e. is of the form ees (1-7?) (1 + e7° + ¢,7* + ¢57°), the c’s being positive unknown constants, 


Jn 
and this is > 063, 
+ If we confine our attention to the seven “great powers,” Austria, France, Germany, Great Britain, 
Italy, Russia and the United States, we find p,.=— ‘143, R=—:125, giving r=—'15 and —*23 with a 
probable error of °3; this result again emphasises the heterogeneity of the material. 


5 
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Of course it is doubtful whether when-we are in ignorance of the character of the 
distribution we could say more than ma 
r='8+'1, found from p,, 
and r='7 +°1,. found by product-moment. / 

These might then be treated as identical for some purposes of inference. But the 
advantage of the longer product-moment method would be that it would have taught 
us that the correlation was non-Gaussian, and given us in the process the regression 
line. “This would probably more than compensate for its greater laboriousness. 


(10) Llustration IV. Correlation between mean Size of Litter in a Generation 
and mean Sex Ratio in the same Generation in the case of Mice, 


The following data are taken from a paper in Biometrika, Vol. v., p. 439. 


TABLE V, 
<S tle Mean siz ; | Sex Ratio | . 
- | = 
Ist 506 | 505 5 3 49 4° 
2nd 4:94 | -491 6 4 +2 4 
3rd 5-96 523 ] 2 —] 1 
4th 503 (ol) “549 2 1 +-1 1 
5th 553 | +462 3 6 a3 9 
6th Teas | 483 4 5 a2) 1 
Thus : S(v,-v,P=20, S(y,- ¥)= 9, 
and P7429; te= "143; 
Whenee : r from p,='45 + ‘23, 


r from R=°25+ <-23. 
The actual value of 7 from product-moment is 
| r='63+°'17. 


This example serves to show that the correlation found from R may when the 
observations are few, not be definitely significant, while when we proceed in the more 
accurate manner it is definitely significant. The R-method is thus shown not to 
have special advantages, but rather peculiar disadvantages for short series. Its merit 
really lies in rapidity of working for assay purposes and rough treatment. 


(11) Lllustration V. Resemblance of Cousins. 
(a) Waedth of Hand. The following table gives the width ofthe hand in 


34 pairs of male adult cousins taken from my series of Cousin Measurements, These 
data are being used by Miss Ethel M. Elderton in a forthcoming paper on this 
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subject, and I have most heartily to thank her for the exhaustive manner in which 
she has dealt with the material-in order to illustrate the whole subject of deter- 
mining correlation by ranks. | 


Taste VI. Width of Hand in. mm. in Pairs of Male Adult Cousins. 


[st cousin x/2ndcousiny || Rank A | RankB | »,-7 (v, > v5)? True Grade of A | True Grade of B 
80:7 80-0 230 Lieb he 2a, 6 36 23°51 20-74) 20-74" 23-1 
90-0 80-0 58 17;17 58 || 41 1681 58:82 20°74 | 20.74 58°82 
80:7 84°7 23 48 | 48 23 — 25 625 23°51 40°66 40°66 23°51 
90-0 84°7 58 48 | 48 58 | 10 100 58°82 40°66 | 40°66 58-82 
80:0 84:7 1% 48 | 48 17 —3l1 961 20°74 40:66 40°66 20-74 
74:5 81:0 3-26) 26°.‘3 — 23 529 5-52 24:74 | 24-74 - 5:52 
81:0 80:0 26 apt Li 726 9 81 24-74 2074 | 2074 24:74 
86-0 81:0 52 26] 26 52 || 26 676 46°00 24°74 | 24-74 46-00 
80:7 83°7 23 43 | 43. 23 — 20 400 23°51 36°36 | 36°36 23°51 
94:0 82-7 64 37 | 37 64 || 27 729 65:26 31:99 | 31°99 65:26 
94:0 817 64 34) 34 64 | 30 900 65°26 27°68 | 27°68 65-26 
76-0 ico Opell 5 — 6 36 844 10°90 } 10:90 8-44 
“760 79-0 i a a a —ll Li 844 17°08 17:08 8-44 
76:0 83:0 SAL 41.1.5 — 36 1296 844 33:29 | 33:29 844 
86:3 88:3 53 54 | 54 53 — 1 1 47-16 54:16 | 54°16 47°16 
92°5. 85:0 60 51) 51 60 9 81 63°51 41°94 | 41:94 63°51 
83:7 81-7 43 34 | 34 43 9 81 36:36 27-68 | 27-68 36°36 
83°7 83°3 43 42) 42 43 i 1 36°36 3462 3462 36°36 
83:7 78:7 43. 15 | 15 43 || 28 784 36:36 16:05 16°05 36°36 
82:0 81:0 36 26 | 26 36 || 10 100 28°95 24:74 | 24-74 28°95 
80 5 80:0 22 Ade| 17 22 5 25 22-71 © 20-74 | 20°74 22-71" | 
75-0 76:0 SO ye be 4 — | 1 610 844, 8-44 6°40 
71:0 76:0 ie Gee Dee E — 4 16 170. 844 844 1°70 
73°0 77-0 SEA Te <2 — 9 81 345 8-44 844 3°45 
84:5 78-0 47 13) 13 47 | 34 1156 39°81 13°78 | 13°78 39°81 | 
76:0 78:0 dea 5 Saal Bs Saas — 8 64 844 13°78 | 13°78 8-44 
93-3 89-7 61 55 | 55 61 6 36 64:53 58:09 | 58:09 64-53 
93°3 82:7 OP 37 at “Ol eae 576 64:53 31°99 | 31:99 64°53 
98°7 82°7 66 37 | 37 66 || 29 841 67°58 31°99 | 31:99 67°58 
89-7 81-0 5D.-26 | 26 55 || 29 841 58-09 24:74 | 24:74 58-09 
81-0 93°3 20-660 | 61 26 —35 1225 24:74 64°53 | 64:53 24-74 
82-7 81-0 BACs PAS PAG er | eee! 121 31:99 24-74 | 24-74 31-99 
98°7 81-0 66 26 | 26 66 || 40 1600 67:58 24:74 | 24-74 67°58 
98°7 89-7 66°55 | 55 - 66- |) 11 121 67°58 58-09 | 58:09 67°58 
Mean) _ go. Mean) a, S(v,—ve) | S(4—v%)? || Mean) «35; 

Size \=83 a Rank }=34 365 |= 15bo9 | Grade} jhe 


The measurements were only read to the millimetre, but since measurements were 
’ taken two or three times in each case the fractions ‘3, °5 or 7 arise, when averaging, 
Since either cousin may be the ‘first ” cousin, we have for a symmetrical table 68 pairs. 
In the third and fourth columns, we have the ranks placed, according as to which 
It will at once be obvious that many ties arise ; 


cousin is considered the “ first.” 


thus no less than eight individuals tie with a width of hand 81mm. at rank 26. It 
They run from 26 to 33, 
We shall speak of this as the mid-rank method. Or, we 


is not so clear what rank ought to be given to them. 
we may call them all 29°5. 


5—2 
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might put them all at 26, because this would probably be the result nearest to the 
true grade*. We shall speak of this as the bracket-rank method +, 

The above table illustrates the work for the bracket-rank method in columns 
5 and 6, the differences of ranks A and B being, however, only written down once, so 
that to find S(»,—»,), we must sum all quantities in the fifth column as if they had 
the same sign, and double the sum of their squares in the sixth column. 


We find : R='2148 and p,.='3922, 
whence r from p,y="408+ ‘072, 
r from R =°361+ > :072. 


If we now investigate the value of R and p,, from the mid-ranks, we find that 
S(v,—v,) = 588 and S (v,—v,)? = 29812. Accordingly : 
A='2369, and p,=*4310, 
Whence : r from p,="448+ °069, 
r from R='396+ >-069. 


Both these values for rv are higher than those determined by the bracket-rank 
process. We must then question whether the mid-rank or the bracket-rank method 
is the better. Or, indeed is it not possible, that sometimes the one, and sometimes 
the other will be the closer according to the nature of the frequency distribution 2 

To illustrate this point the actual grades on the basis of normal distribution have 
been calculated by Eqn. (xii). It must be remembered that 5 has to be added to 
the grade to obtain the rank, Eqn. (xiii). 

We find : Mean width of hand = 83°16 mm. 

Standard Deviation = 6:201 mm. 


As illustration of the method consider the hand of width 84:7 mm., its deviation 
is 1°54 and the ratio of this to the 8. D. =-248, this corresponds to a value of $(1+a), 
in the notation of Sheppard’s Tables, ="59793 and multiplied by 68 gives the grade 
40°66, corresponding to a rank 41°16, as against the observed rank 48 or a mid-rank 
49! Thus the actual size of organ corresponding to a bracket rank may differ widely 
from the size really belonging to the ranked organ, or the true grade in a general 
population differ very considerably from the spurious grade or rank in the sample 
used. This point again indicates how little can be judged from ranks unless we 
associate the rank distribution with some frequency hypothesis, 

Having found the true grades we may correlate them together to find Px, but in 
using the formula 


Se SAG das 
Pot bit ZIV Xoo 


* That is, find o, and calculate g, and g, from Eqn. (xii) p. 10; the true grade in this case is 24°74, and 
¥,;=9, + 5 = 25°24 is even below 26, not above it. 

} To adopt a term from the examination world, where the place number of the bracket is measured 
only by those above, « 
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we may adopt either the theoretical value ~ N? for o,, or we can actually calculate 
its value. Now 74,N?=3854 and o,?=365'94, and thus there is a very considerable 
deviation from normality in the series* S (g,— g,)’ =31153°195, and thus: 


Py» found from the true o,’=°3740, 
P» found from of=7;N* =°'4055. 
Whence : r from true o,’=°3890, 


yr from of =p, = 4215. 


If we might judge from this single case we should conclude that the bracket-rank 
method gave a closer result to the grade method than the mid-rank method. But the 
question now arises, how close after all are all these grade rank methods to the corre- 
lation coefficient in any short series such as the present ? 


Accordingly the series was worked out by product moment and the result obtained 


was 
r= ‘331+ *073. 


Thus we see that the actual correlation is considerably lower than that given 
by any of the rank or grade processes. It is perfectly true that °33 and ‘45 are 
within double the probable error, and therefore two different random samples of the 
real population might have given as widely divergent results. But this is really the 
case of two different methods applied to the same sample. And further the actual 
correlation tells us that as far as this sample is concerned the true answer is likely to 
lie between ‘19 and °48, but the mid-rank method tells us that it is likely to lie 
between °31 and ‘58t. Now it is clear we might for some extraneous reason hold the 
value likely to be ‘56, and we should find nothing to contradict this in the mid-rank 
result. But the proper method of determining 7 would show us that such a value 
was itself very unlikely. Thus the latter method when it diverges less than twice 
the probable error from the result of the rank method may yet forbid us to interpret 
the results in a manner admissible on the rank method. We cannot argue in like 
manner from the grade or rank result because that method has assumed an hypothesis, 
not made in the product-moment treatment, i.e. that of normal correlation, which is 
here not justified by the results. 

But even the amount of agreement here oteatt is to be considered rather excep- 
tional. I owe to Miss Elderton the working out of three other pairs of characters in 
the same set of male cousins each in five different ways. I have myself done each 
of them in three more ways, namely by Variate Differences as in Art. 2, and by the & 
method. The results are given in the Table below. 


* The mean grade in fact = 32°41 and not 34 also, 
+ Taking a range of twice the probable error on either side the means. 


38 KARL PEARSON 


Taste VII. Comparison of Correlation Coefficients Sound by Various Methods. 
Resemblance of Hand in 68 Pairs of Male Cousins. 


Grades Bea | 
. Product Variate | | 44s 7 . 
Character tera: Dincna eee | Be nenk Mid-Rank 
True og} o°=7,N?| Br | By R BF ps By R | 
| Width of Hand | “33+ 07 37 rae) |! oly 414+:°07 | 36 | -45+-07 ‘40 
Width of Wrist | 17 +-08 25 pL el an 07+ °085 | 05 | -08+-085 | -03 
_ Length of Index Finger | +19 + 08 “14 iG: =i), SEs} 214-08 | -29 oko O85e 29 
Length of Little Finger | -29 + -075 "26 18 30 204-08 | 19 } 334+-08 | -21 
| = 2458 | 
Mean of Four Results 25 “25 Peril “AAS 22 | ey 24 a2 
HeCHapa aust Devienen 053 | -069 | -060 079 |-094] -o79 | -096 
rom true 7 | 


Tt will, I think, be clear from this table that for series even with as many as 
68 pairs—and this is approaching the limit at which any time is gained by using 
rank methods—we cannot hope to ascertain the correlation of the sample by such 
methods within about *1 of its value, and as the probable error of the sample may be 
‘07, we may well deviate ‘2 from the population value in our estimate. We are 
accordingly very unlikely to reach reliable results by rank methods for the 8 to 10 
observations to which Dr Spearman proposes to apply his R-method. We see that 
the mean values are fairly close, although the variate difference and the second grade | 
methods give the best results. Judged by mean square deviations from product 
moment results, the variate difference is easily first, then come the laborious grade 
methods, the rank methods by p, about fifty per cent. worse than the variate 
difference, and lastly the R methods not quite 100 per cent. worse. Thus we note 
that when a series is not fairly long and not approximately normal, the different rank 
and grade methods will give very diverse results. But when a series is fairly long, 
say 100 or more observations, then there is no advantage in rapidity from the rank 
method ; the formation of a grouped correlation table, and the use of the product 
moment is just as rapid, and further conveys a great deal more of valuable information. 


(12) Conclusions. Three new methods of determining variate correlation have 
been given in this paper. The first, that of variate differences, seems likely to be of 
some service in the case of symmetrical tables containing large numbers, the frequency 
being approximately normal, homotyposis tables may be taken as illustration. 

The second that of deducing variate correlation from correlation of ranks, may be 
of service when it is not possible to put a quantitative value on the individual 
character. Thus it might be easy to form a relative series of intensity of pigment, 
and place individuals in rank. But mere correlation of ranks is not in itself a com- 
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parable character, as the variate correlation may have widely different values for the 
same ranking. Justification for the comparability depends upon assuming a wide 
spread rule of frequency distribution, and this rule can hardly be other than normality. 
The present paper shows how to deduce variate correlation from correlation of ranks. 
It. shows, however, that such a method of reaching variate correlation is considerably 
less exact than the usual product-moment method. ‘There is no gain in accuracy, but 
the reverse in using such a method in the case of short series. 

Thirdly, the method proposed by Spearman of deducing the correlation of ranks 
from the positive differences of ranks is discussed, and the error of the process by 
which he has deduced for it an accuracy greater than that of the more usual methods 
of finding correlation is indicated. A method for deducing variate correlation from 
positive difference of ranks is indicated. The method is very rapid for short series, 
say those not exceeding 20 observations, but it is less accurate than the product- 
moment method, and considerable changes in the final value reached will be found to 
arise according as we use bracket-ranks or mid-ranks in the case of ties. The 
comparison with true grades for a few special cases, does not enable us to say which 
is the better method ; the deviations from normality sometimes appear to make one, 
sometimes the other, the closer to the true correlation. 

In conclusion, I think, we may say that variate correlation found by ranks may 
prove to be a useful auxiliary method of dealing with correlation, when it is needful 
to give a rough answer to a problem in a brief time, or when the material itself is 
incapable of being accurately measured. In all such cases mean square of rank 
differences will be more accurate than mean positive rank difference. But both 
methods must be used with caution, and their easy application must not lead us to 
approve exaggerated statements as to their accuracy. 
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